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1 Introduction 

For  data  whirl,  take  the  form  of  a two-way  contingency 
table,  many  authors  have  examined  models  other  than 
the  common  model  of  independence  of  two  classifica- 
tions. In  the  literature  are  quasi  independence  models, 
clustered  sampling  models,  intraclass  models,  and  the 
Uradley-Terry  model,  among  others.  In  order  to  esti- 
mate model  parameters,  iterative  methods  are  required, 
and  this  leads  to  the  problem  of  developing  efficient  al- 
gorithms. 

2 Quasi  Independence  Models 

If  there  are  cells  in  a contingency  table  which  a priori 
have  a zero  count,  and  there  are  cells  in  that  row  or 
column  which  have  non  zero  counts,  the  independence 
model  is  not  appropriate.  To  cope  with  this  problem  of 
so-called  structural  zeroes,  the  notion  of  quasi  indepen- 
dence is  useful.  In  a quasi  independence  model  the  row 
and  column  classifications  are  independent,  provided  the 
cells  with  a priori  zero  counts  are  ignored. 

Kxamples  given  in  the  literature  of  quasi  indepen- 
dence models  include  the  random  pairing  models  of 


l.arntz  and  Weisbcrg  (I97(>)  and  de  Jong,  Greig  and 
Madan  ( 198'!),  the  mover-stayer  model  as  discussed  by 
Morgan  and  Titterington  ( 1977)  and  the  model  of  Lemon 
and  Chatficld  (1971),  which  is  an  alternative  toa  Markov 
chain  model,  l.arntz  and  Weisberg’s  model  can  be  ob- 
tained from  Lemon  and  Chatlield’s  by  folding  along  the 
main  diagonal. 

These  models  are  all  for  data  which  take  the  form 
of  a contingency  table  with  entries  on  the  main  diagonal 
whirl,  are  zero  a priori,  with  the  exception  of  l.arntz 
and  Weisberg’s  model,  where  the  entries  on  or  below  the 
main  diagonal  are  all  zero  ra  priori. 

In  order  to  fit  these  models  to  data,  various  n.etli 
ods  have  been  proposed.  Iterative  proportional  fitting 
(IFF)  is  commonly  used.  IFF  requires  that  the  model 
be  log  linear,  which  is  not  the  case  for  de  Jong,  Greig 
and  Madan ’s  model.  The  Newlon-Rapl.son  method  con- 
verges quickly,  but  it  is  not  easy  to  implement  if  the 
Hessian  is  not  diagonal.  The  Hessian  is  diagonal  for  the 
luovi  r-stayer  model  and  de  Jong,  Greig  and  Madan’s 
random  pairing  model,  but  not  for  the  other  models. 
Fixed  point  iterations  have  been  used  by  a number  of 
authors,  but  these  can  be  slow  to  converge.  Brown 
(1974)  developed  a method  for  dealing  with  a prion  ze- 
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roes,  which  iterates  over  the  cells  which  are  known  to 
he  zero.  Brown’s  method  becomes  less  and  less  efficient 
with  an  increasing  number  of  zero  cells,  de  Jong,  Greig 
and  Madan  (1983)  developed  a method  for  fitting  their 
random  pairing  model  which  involves  a reparameterisa- 
tion,  then  fixed  point  iteration.  This  method  can  also 
be  adapted  to  fit  other  quasi  independence  models. 

It  can  be  shown  that  when  the  table  is  symmetric, 
the  parameter  estimates  obtained  for  Lemon  and  Chat- 
field’s  (1971)  model  are  identical  with  those  obtained 
from  fitting  the  mover-stayer  model.  This  means  that 
the  models  of  both  Lemon  and  Ghatfield,  and  Larntz 
and  Weisberg  may  be  fitted  by  symmetrising  the  data 
and  fitting  the  mover-stayer  model.  Thus  a readily  im- 
plementable  Newton-ltaphson  approach  is  available  for 
these  models. 

All  the  methods  so  far  discussed  estimate  a proba- 
bility distribution  and  involve  a multivariable  iteration. 
The  authors  have  developed  new  methods  for  fitting 
quasi  independence  models  which  require  the  solution  of 
a nonlinear  equation  in  a single  unknown.  This  equation 
is  readily  solved  using  Newton’s  method.  This  gives  fast, 
very  easily-implemented  methods.  No  programming  is 
required,  and  well-known  packages  such  as  Minitab  or  a 
spreadsheet  may  be  used  to  do  the  calculations. 
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3 Other  Models 

In  the  clustered  sampling  model  with  clusters  of  size  two, 
if  a number  of  clusters  are  observed,  and  each  member 
of  the  cluster  is  classified  according  to  some  character- 
istic, the  data  take  the  form  of  a two-way  contingency 
table.  Gohen  (1976)  lias  given  a method  which  requires 
a two-stage  iteration  procedure,  with  one  stage  being  the 
solution  of  a non  linear  equation.  Using  a reparameteri- 
sation  the  authors  were  able  to  reduce  the  computations 
required.  A two-stage  iteration  is  still  required,  but  only 
simple  expressions  need  to  be  evaluated  at  any  stage. 
This  work  is  reported  in  Scott  and  Wang  (1990). 

Attempts  were  made  to  develop  improved  methods 
for  intraclass  models  (see  Haber(1982))  and  the  Bradley- 
Terry  model,  without  success. 
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